Personal audio assistant device and method

ABSTRACT

A personal audio assistant includes a first microphone for capturing audio, a communication module communicatively coupled to the first microphone and for accessing information from a network, a memory storage unit communicatively coupled to the first microphone, and a voice controlled interface coupled to the microphone. The voice controlled interface is configured to access the network in response to detection of at least one or more voice commands from the captured audio and present results in response to accessing the network to a speaker via a text-to-speech synthesizer. Other embodiments are disclosed.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the priority benefit of U.S. ProvisionalApplication No. 60/806,769, under 35 U.S.C. §119(e), filed 8 Jul. 2006,and U.S. Non-Provisional application Ser. No. 11/774,965 filed 9 Jul.2007, both of which are incorporated herein by reference in theirentireties.

FIELD

The claimed embodiments relate in general to methods and devices for thestorage and recall of audio content via an earpiece, and in particular,though not exclusively, for the storage and playing of music or verbalcontent on a system that is built into a headphone.

BACKGROUND

Present audio content playing devices are separated from the headphonesystem that normally contains the speakers (also referred to asreceivers). The reason for this has typically been that audio contenthas been stored on disks that require a separate playing system.However, even with the advent of storing audio content on non-disk RAM(Random Access Memory) storage systems, the audio content player hasbeen separated from the earpiece system (e.g., plug in headphones orearbuds). Combining the capacity for audio download and playing in anearpiece system is not obvious over related art since the userinteraction system (e.g., play button, keyboard system) does not readilyappear compatible with the size of an earpiece device and the difficultyof user interaction.

Additionally, no system currently exists for registration and downloadof audio content into an earpiece.

BRIEF DESCRIPTION OF THE DRAWINGS

Examples of the claimed embodiments will become apparent from thefollowing detailed description, taken in conjunction with the drawingsin which:

FIG. 1 illustrates the connection between an earpiece device (103 and104) and a communication network;

FIG. 2 illustrates at least one exemplary embodiment where earpiecedevices share information with other earpiece devices within range(e.g., GPS location and identity);

FIG. 3 illustrates an example of various elements that can be part of anearpiece device in accordance with at least one exemplary embodiment;

FIG. 4 illustrates an example of a communication system in accordancewith at least one exemplary embodiment that a user can use to registervia his/her computer;

FIG. 5A illustrates an earpiece that can store and download audiocontent in accordance with at least one exemplary embodiment;

FIG. 5B illustrates a block diagram of the earpiece of FIG. 5A; and

FIG. 6 illustrates a user interface for setting the parameters of thePersonal Audio Assistant.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

The following description of exemplary embodiment(s) is merelyillustrative in nature and is in no way intended to limit the claimedembodiments, its application, or uses.

Processes, methods, materials and devices known by one of ordinary skillin the relevant arts can not be discussed in detail but are intended tobe part of the enabling discussion where appropriate for example thegeneration and use of transfer functions.

Notice that similar reference numerals and letters refer to similaritems in the following figures, and thus once an item is defined in onefigure, it can not be discussed for following figures.

Note that herein when referring to correcting or corrections of an error(e.g., noise), a reduction of the error and/or a correction of the erroris intended.

SUMMARY OF EXEMPLARY EMBODIMENTS

At least one exemplary embodiment is directed to a system forPersonalized Services delivered to a Personal Audio Assistantincorporated within an earpiece (e.g., earbuds, headphones).Personalized Services include content such as music files (for previewor purchase) related to a user's preferences, reminders from personalscheduling software, delivery and text-to-speech, speech-to-textprocessing of email, marketing messages, delivery and text-to-speech ofstock market information, medication reminders, foreign languageinstruction, academic instruction, time and date information,speech-to-speech delivery, instructions from a GPS system and others. APersonal Audio Assistant can be an audio playback platform for providingthe user with Personalized Services.

At least one exemplary embodiment is directed to a Personal AudioAssistant system that is included as part of an earpiece (e.g.,Headphone system). The Personal Audio Assistant is capable of digitalaudio playback, mitigating the need to carry a personal music player.Furthermore, a subscription-based service provides audio content to theuser through the Personal Audio Assistant. The type of audio content,which is automatically provided to the user, is based on the user'spreferences, which are obtained through a registration process.

The audio content, which is seamlessly downloaded to the Personal AudioAssistant in the background, is managed from a Server system and is onlyavailable on the Personal Audio Assistant for a predetermined period oftime or for a fixed number of playback counts. However, the user canpurchase any music file or electronic book directly from the PersonalAudio Assistant with a simple one-click control interface, storing thepurchased audio content on the Personal Audio Assistant as well asstoring the content permanently in a user storage lock-box location onthe Server system.

The system provides for audio content to be new and “fresh” each timethe user auditions the content. As such, the content is typicallyauditioned in a first-in:first-out scenario. In one such example, theuser has turned on the Personal Audio Assistant at 8:00 am and by 10:00am has auditioned 2 hours of content that were created for the user as amanifestation of the user's choices of their preferences of genre,artist, their demographics, day of the week, time of day and purchasehistory. The system also provides for the elimination of a particularsong or playlist in situ.

As the user's Listening History Envelope is updated based on experience,subsequent downloads will only contain content incorporating theserevised preferences. The Personal Audio Assistant provides for amplememory, thus permitting hours of uninterrupted playback without the needto download additional content from the server. When in need, thePersonal Audio Assistant automatically interrogates variouscommunication platforms as it searches for connections. Once aconnection is made, the Listener History Envelope file is uploaded tothe server, and a new set of personalized playlist content is downloadedto the Personal Audio Assistant. Accordingly, as the Personal AudioAssistant content is auditioned and thus depleted, the communicationssystem provides for constant replenishment.

In another embodiment, the Personal Audio Assistant also provides for anew set of business solutions to be offered to the music industry. Asthe personalized audio content is only available for audition for alimited period of time, and can not be sent to the user again from forweeks to months, the user's purchasing behavior can be demonstrated asspontaneous. The basic model of “Try before you buy” is the expectedoutcome. In another iteration, the distributor of the music can chooseto offer discounts, which can be time-sensitive or quantity-sensitive innature, in effect promoting greater purchase activity from the user.

In another iteration, while in audition a user can wish to place thedesired content in a hold status. The hold status forms the basis of a“wish list,” thus allowing the user to hold for future considerationaudio content while it is being auditioned. This content resides in thememory of the Personal Audio Assistant for a defined period, and isautomatically erased, or the user can do so manually. The selectedcontent will also appear on the user's computer via a URL address; hereit resides on the server ready for audition or purchase and download.

The system is designed to operate as simply as possible. Using a singlebutton, which has multiple contacts, the interface allows the user topurchase, delete, skip to next, and add to a wish list and even controla listening level.

In another iteration, the user can download their own music to thePersonal Audio Assistant for audition. The Personal Audio Assistantsystem is capable of text-to-speech processing and can interface withpersonal scheduling software to provide auditory schedule reminders forthe user. Auditory reminders relating to the user's medication scheduleare also generated by the system.

At least one exemplary embodiment includes input Acoustic Transducers(microphones) for capturing user's speech as well as EnvironmentalAudio. In further embodiments, stereo input Acoustic Transducers captureEnvironmental Audio, and, mixing it with the audio signal path, presentthe ambient sound field to the user, mitigating the need to remove theHeadphone apparatus for normal conversation.

Additional exemplary embodiments are directed to various scenarios forthe delivery and consumption of audio content. The Personal AudioAssistant can store and play back audio content in compressed digitalaudio formats. In one embodiment, the storage memory of the PersonalAudio Assistant is completely closed to the end-user and controlled fromthe Server. This allows for audio content to be distributed on atemporary basis, as part of a subscription service. In another iterationof the present embodiments, the storage memory of the Personal AudioAssistant is not completely closed to the end-user, allowing the user totransfer audio content to the Personal Audio Assistant from any capabledevice such as a Personal Computer or a Personal Music Player.

In at least one exemplary embodiment the Personal Audio Assistantautomatically scans for other Bluetoot®-enabled audio playback systemsand notifies the user that additional devices are available. Theseadditional devices can include a Bluetooth® video system, televisionsystem, personal video player, video camera, cell phone, anotherPersonal Audio Assistant and others.

In another iteration, the Personal Audio Assistant can be directlyconnected to a Terrestrial Radio receiver, or have such a receiver builtin to the system.

In another exemplary embodiment, a technique known as Sonification canbe used to convey statistical or other numerical information to aheadphone. For example, the user would be able to receive informationabout the growth or decline of a particular stock, groups of stocks oreven sectors of the markets though the Personal Audio Assistant. Manydifferent components can be altered to change the user's perception ofthe sound, and in turn, their perception of the underlying informationbeing portrayed. An increase or decrease in some level of share price ortrading levels can be presented to the user. A stock market price can beportrayed by an increase in the frequency of a sine tone as the stockprice rose, and a decline in frequency as it fell. To allow the user todetermine that more than one stock was being portrayed, differenttimbres and spatial locations might be used for the different stocks, orthey can be played to the user from different points in space, forexample, through different sides of their headphones. The user can actupon this auditory information and use the controls built-in to theheadphone to either purchase or sell a particular stock position.

Furthermore, specific sonification techniques and preferences can bepresented to the user as “themes” from which the user can select. Forexample, one theme might auralize the current trading price of one stockwith an ambient sine tone in the left ear, the price of another stock inthe right ear, their respective trade volumes as perceived elevationusing personalized head-related transfer function binauralization, andthe current global index or other market indicator as the combinedperceptual loudness of both tones. Such a scheme affords ambientauditory display in this example of five dimensions of financial datawithout compromising the user's ability to converse or work on othertasks. In another embodiment, the system affords users the ability tocustomize themes to their liking and to rapidly switch among them usingsimple speech commands. Additionally, the user can search the web fromvoice commands and receive results via a text to speech synthesizer.

In yet another exemplary embodiment the Personal Audio Assistant (PAA)functions as a dictation device for medical professionals for dictatingclinical information to a patient's medical record, or writeprescriptions for medication or devices. Conversely, the PAA canfunction as text-to-speech allowing the clinician to auditioninformation from a medical record, rather than reading. The PAA can saveconsiderable time preparing clinician interaction with a patient.

In another iteration, the Personal Audio Assistant can function as atool to locate other users of Personal Audio Assistant who share commoninterests, or who are searching for particular attributes of otherusers. Whereas the first user has stored specific personal informationin the Public Data memory of the Personal Audio Assistant, an example ofwhich might be related to schools attended, marital status, professionetc, or the first user can be in search of another user with theseattributes and whereas a second user of a Personal Audio Assistant comeswithin communication range of the first user, the individual PersonalAudio Assistants communicate with each other, access the personalinformation stored in each of their respective Public Data memories toascertain if these users have common interests. If a match occurs, eachunit can contain both audible and visual indicators announcing that amatch has been made and thus each user can start dialog eitherphysically or electronically via the environmental microphones.

EXAMPLES OF TERMINOLOGY

Note that the following non-limiting examples of terminology are solelyintended to aid in understanding various exemplary embodiments and arenot intended to be restrictive of the meaning of terms nor allinclusive.

Acoustic Isolation Cushion: An “Acoustic Isolation Cushion” shall bedefined as a circum-aural or intra-aural device that provides acousticisolation from Environmental Noise. Acoustic Isolation Cushions can beincluded as part of a Headphones system, allowing the output of theacoustical transducers to reach the ear unimpeded, but still providingacoustic isolation from Environmental Noise.

Acoustic Transducer: An “Acoustic Transducer” shall be defined as adevice that converts sound pressure level variations into electronicvoltages or vice versa. Acoustic Transducers include microphones,loudspeakers, Headphones, and other devices.

Audio Playback: “Audio Playback” shall be defined as the auditorystimuli generated when Playback Hardware reproduces audio content(music, speech, etc) for a listener or a group of listeners listening toHeadphones.

Audition: “Audition” shall be defined as the process of detecting soundstimulus using the human auditory system. This includes the physical,psychophysical, psychoacoustic, and cognitive processes associated withthe perception of acoustic stimuli.

Client: A “Client” shall be defined as a system that communicates with aServer, usually over a communications network, and directly interfaceswith a user. Examples of Client systems include personal computers andmobile phones.

Communications Port: A Communication Port shall be defined as aninterface port supporting bidirectional transmission protocols (TCP/IP,USB, IEEE 1394, IEEE 802.11, Bluetooth®, A2DP, GSM, CDMA, or others) viaa communications network (e.g., the Internet, cellular networks).

Control Data: “Control Data” shall be defined as information thatdictates the operating parameters for a system or a set of systems.

Earcon: An Earcon shall be defined as a Personalized Audio signal thatinforms the User of a pending event typically inserted in advance of theupcoming audio content.

Ear Mold Style: “Ear Mold Style” shall be defined as a description ofthe form factor for an intra-aural device (e.g., hearing aids). Ear MoldStyles include completely in the canal (CIC), in the canal (ITC), in theear (ITE), and behind the ear (BTE).

Environmental Audio: “Environmental Audio” shall be defined as auditorystimuli of interest to the user in the environment where the user ispresent. Environmental Audio includes speech and music in theenvironment.

Environmental Noise: “Environmental Noise” shall be defined as theauditory stimuli inherent to a particular environment where the user ispresent and which the user does not wish to audition. The drone ofhighway traffic is a common example of Environmental Noise. Note thatEnvironmental Noise and Audio Playback are two distinct types ofauditory stimuli. Environmental Noise does not typically include Musicor other audio content.

E-Tailing System: An “E-tailing System” shall be defined as a web-basedsolution through which a user can search, preview and acquire someavailable product or service. Short for “electronic retailing,”E-tailing is the offering of retail goods or services on the Internet.Used in Internet discussions as early as 1995, the term E-tailing seemsan almost inevitable addition to e-mail, e-business, and e-commerce.E-tailing is synonymous with business-to-consumer (B2C) transactions.Accordingly, the user can be required to register by submitting personalinformation, and the user can be required to provide payment in the formof Currency or other consideration in exchange for the product orservice. Optionally, a sponsor can bear the cost of compensating theE-tailer, while the user would receive the product or service.

Generic HRTF: A “Generic HRTF” shall be defined as a set of HRTF datathat is intended for use by any Member. A Generic HRTF can provide ageneralized model of the parts of the human anatomy relevant to auditionand localization, or simply a model of the anatomy of an individualother than the Member. The application of Generic HRTF data to AudioContent provides the least convincing Spatial Image for the Member,relative to Semi-Personalized and Personalized HRTF data. Generic HRTFdata is generally retrieved from publicly available databases such asthe CIPIC HRTF database.

Headphones: “Headphones” (also known as earphones, earbuds,stereophones, headsets, Canalphones, or the slang term “cans”) are apair of transducers that receive an electrical signal from a mediaplayer, communication receivers and transceivers, and use speakersplaced in close proximity to the ears (hence the name earphone) toconvert the signal into audible sound waves. Headphones are intended aspersonal listening devices that are placed either circum-aural orintra-aural according to one of the Ear Mold Styles, as well as otherdevices that meet the above definition such as advanced eyewear thatincludes Acoustical Transducers (i.e. Dataview). Headphones can alsoinclude stereo input Acoustic Transducers (microphones) included as partof the Ear Mold Style form factor.

HRTF: “HRTF” is an acronym for head-related transfer function—a set ofdata that describes the acoustical reflection characteristics of anindividual's anatomy relevant to audition. Although in practice they aredistinct (but directly related), this definition of HRTF encompasses thehead-related impulse response (HRIR) or any other set of data thatdescribes some aspects of an individual's anatomy relevant to audition.

Informed Consent: “Informed Consent” shall be defined as a legalcondition whereby a person can provide formal consent based upon anappreciation and understanding of the facts and implications associatedwith a specific action. For minors or individuals without completepossession of their faculties, Informed Consent includes the formalconsent of a parent or guardian.

Listening History Envelope: “Listening History Envelope” shall bedefined as a record of a user's listening habits over time. The envelopeincludes system data, time system was turned off, time the system ispresenting content, when the system doesn't audition, systemtransducers, when the user auditions content, time stamp of contentbeing auditioned, content which is: skipped, deleted, played multipletimes, saved in the Wish List, and time between listening sessions.

Music: “Music” shall be defined as a form of expression in the medium oftime using the structures of tones and silence to create complex formsin time through construction of patterns and combinations of naturalstimuli, principally sound. Music can also be referred to as audio mediaor audio content.

Playback Hardware: Any device used to play previously recorded or livestreaming audio. Playback Hardware includes Headphones, loudspeakers,personal music players, mobile phones, and other devices.

Personal Audio Assistant: A “Personal Audio Assistant” shall be definedas a portable system capable of interfacing with a communicationsnetwork, directly or through an intermediate, to transmit and receiveaudio signals and other data.

Personal Computer: “Personal Computer” shall be defined as any piece ofhardware that is an open system capable of compiling, linking, andexecuting a programming language (such as C/C++, Java™, etc.).

Personal Music Player: “Personal Music Player” shall be defined as anyportable device that implements perceptual audio decoder technology butis a closed system in that users are not generally allowed or able towrite software for the device.

Personalized HRTF: A “Personalized HRTF” shall be defined as a set ofHRTF data that is measured for a specific Member and unique to thatMember. The application of Personalized HRTF data to Audio Contentcreates, by far, the most convincing Spatial Image for the Member(Begault et. al. 2001, D. Zotkin, R. Duraiswami, and L. Davis 2002).

Personalized Services: “Personalized Services” shall be defined asservices customized to better meet the needs of an individual.Personalized Services include media content (for preview or purchase)related to a user's preferences, reminders from personal schedulingsoftware, delivery and text-to-speech processing of email, marketingmessages, delivery and text-to-speech of stock market information,medication reminders, foreign language instruction [real-time foreignlanguage translation], academic instruction, time and date information,and others.

Public Data: “Public Data” shall be defined as data which containsspecific and personal information about the registered user of thePersonal Audio Assistant. The registered user chooses which portions oftheir complete Registration Process data they wish to include in thissubset. This data becomes distributed to other users who have compliantdevices thus allowing other users to know specific details of theregistered user.

Registration Process: “Registration Process” includes the acquisition ofthe user's preference via a web page. Typically, the process wouldinclude the items to be captured: Age, demographics, email, gender,Relative Audiogram, Personal Preferences, banking information, creditcard information, wake-up and sleep times, music preferences by genre,artist, preferences for writers and authors, desire to receiveadvertising, turn-on listening level, equalization, email preferences,parental control setup as well as other user-controlled settings.

Relative Audiogram: A “Relative Audiogram” shall be defined as ameasured set of data describing a specific individual's hearingthreshold level as a function of frequency. A Relative Audiogram is onlyan approximate Audiogram, leaving more complete Audiogram analysis toqualified audiologists.

Semi-Personalized HRTF: A “Semi-Personalized HRTF” shall be defined as aset of HRTF data that is selected from a database of known HRTF data asthe “best-fit” for a specific user. Semi-Personalized HRTF data is notnecessarily unique to one user; however, interpolation and matchingalgorithms can be employed to modify HRTF data from the database toimprove the accuracy of a Semi-Personalized HRTF. The application ofSemi-Personalized HRTF data to Audio Content provides a Spatial Imagethat is improved compared to that of Generic HRTF data, but lesseffective than that of Personalized HRTF data. The embodiments withinspeak to a variety of methods for determining the best-fit HRTF data fora particular Member including anthropometrical measurements extractedfrom photographs and deduction.

Server: A “Server” shall be defined as a system that controls centrallyheld data and communicates with Clients.

Sonification: “Sonification” shall be defined as the use of non-speechaudio to convey information or to aurally perceptualize non-acousticdata (auralize). Due to a variety of phenomena involving humancognition, certain types of information can be better or moreefficiently conveyed using auditory means than, for example, visualmeans.

Exemplary Embodiments

FIG. 1 illustrates the connection between an earpiece device (103 and104) and a communication network (101) via communication interface 102,which can be operatively connected (via a wired or wireless connection)to a server system (100) and/or an e-mail server (105). Additionally aradio signal (e.g., satellite radio) can be input into the earpiece 500(FIG. 5B) via a communication module (e.g., Bluetooth® wireless module515).

FIG. 2 illustrates at least one exemplary embodiment where earpiecedevices share information with other earpiece devices within range(e.g., GPS location and identity). For example multiple users 202, 203,204, and 206) can send signals to each individual earpiece (e.g., 500)when in range (e.g., via a wireless connection 205) or to a mobile audiocommunications device 200 via a wireless connection (201) with eachearpiece (500). Additionally information (e.g., audio content, softwaredownload) can be sent via a client's computer 207 to each earpiece,either directly (e.g., 205), or via 200. For example audio content canbe retrieved on a user's computer and sent to the earpieces that haveauthorization to use it.

FIG. 3 illustrates an example of various elements that can be part of anearpiece device in accordance with at least one exemplary embodiment.The earpiece can include all or some of the elements illustrated in FIG.5B. For example the logic circuit 572 or the operatively connectedmemory storage device 585, can include spatial enhancement software 329,a DSP codec 330, a speech synthesis and recognition system 311, and adigital timer system 312. Additional elements can be connected to thelogic circuit 572 as needed, for example a power supply 320, a softwarecommunication interface 307 (e.g., wireless module 515) (which may beconnected to communication network 308), data port interface 306, audioinput buffers 300 connected to digital audio input 302 and/or analogaudio input 303 converted to digital via an ADC 301, environmental audioinput acoustic transducer(s) 321 converted to digital via an ADC 316,user control 324, digital audio output 328, output acoustic transducers319 (which receive signals converted to analog via a DAC 310 andamplified via amplifier 309), display system 318, communication buffers325, program memory 305, data and personal memory 315, as well as otherelectronic devices as known by one of ordinary skill in the relevantarts.

FIG. 4 illustrates an example of a communication system in accordancewith at least one exemplary embodiment that a user can use to registervia his/her computer 419, via a communication network 400 (e.g.,Internet connection) connected to many various database and registrationsystems as illustrated and labeled in FIG. 4. For example, server 401,database management system 402, audio content preview database 403,audio content database 404, playlist engine 405, user's informationdatabase 406, HRTF acquisition process module 407, HRTF database 408,lock-box server 409, registration engine 410, e-payment system 413,business-to-business module 414, e-tailing system 415, audiogramdatabase 416, audiogram engine 417 and/or e-mail server 418.

FIG. 5A illustrates an earpiece 500 having sealing section 540 in an earcanal that can store and download audio content 560 in accordance withat least one exemplary embodiment. The earpiece 500, can include a firstuser interaction element 530 (e.g., a button), that can be used to turnthe earpiece 500 on, or if on then activate an audio play command tostart playing saved audio content. The earpiece 500 can also include asecond user interaction element 550 (e.g., a slide control) that can beused for example to control the volume. The earpiece 500 can alsoinclude recharge ports 570, that can accept two wires of varying voltagethat can be inserted into the recharge ports 570 to recharge anybatteries in the earpiece 500. The earpiece 500 can include an ambientmicrophone 520 and an optional communication antenna 510, that if neededcan aid in the communication between the earpiece 500 and acommunication network.

FIG. 5B illustrates a block diagram of the earpiece of FIG. 5A,illustrating the first user interaction element 530, the ambientmicrophone (AM) 520, that can be used to pick up ambient audio content,an ear canal microphone (ECM) 590 that can pick up audio in the earcanal region, an ear canal receiver (ECR) 580 that can direct audiocontent to the ear drum, all of which can be connected operatively to alogic circuit 572. A memory storage device 585 can be operativelyconnected to the logic circuit (LC) 572, and can store data such asregistration, preference, and audio content data. The optionalcommunication antenna 510 can be connected to a communication module(e.g., wireless module 515), and can receive or transmit information 560to a communication network.

FIG. 6 illustrates a user interface for setting the parameters stored inthe memory storage device 585. For example a user can use his/hercomputer 419 to communicate with a server 401 (e.g., via a communicationnetwork 400) to start the user's registration (e.g., with an audiocontent provider). The registration information can then be transmitted600 to set the stored parameters in the memory storage device 585 of theearpiece 500. Additionally a requested (e.g., bought) audio content canbe downloaded 610 into the memory storage device 585 of the earpiece500.

At least one exemplary embodiment is directed to an earpiece comprising:an ambient microphone; an ear canal microphone; an ear canal receiver; asealing section; a logic circuit; a communication module; a memorystorage unit; and a user interaction element, where the user interactionelement is configured to send a play command to the logic circuit whenactivated by a user where the logic circuit reads registrationparameters stored on the memory storage unit and sends audio content tothe ear canal receiver according to the registration parameters.

In at least one exemplary embodiment the audio content is stored in thememory storage unit. The earpiece according to claim 2, where thecommunications module is a wireless communications module. Additionallythe earpiece can include a second user interaction element configured toalter the volume of the audio content that is emitted from the ear canalreceiver.

Upon a play command being received by the logic circuit the logiccircuit can check registration parameters stored in the memory storagedevice for example one of the registration parameters can be whether theaudio content is a sample audio content or a fully purchased audiocontent, or the allowed number of times an audio content can be played,and a counter value that keeps track of the number of times the audiocontent has been played.

The earpiece can send an auditory warning to be emitted by the ear canalreceiver when the counter value is greater than or equal to the allowednumber of times the audio content can be played, and where the logiccircuit does not send the audio content to the ear canal receiver.

Further Exemplary Embodiments

At least one exemplary embodiment is directed to a system for thedelivery of Personalized Services to Personal Audio Assistants, thesystem comprising: a Personal Audio Assistant system for presentingPersonalized Services to the user as Audio Playback; a Server system foruser registration, Personalized Service management, and communication; aRegistration Process for collecting detailed registration informationfrom users, including the information necessary for creatingPersonalized Services; a communications protocol (TCP/IP, USB, IEEE1394, IEEE 802.11, Bluetooth®, A2DP, GSM, CDMA, or other) and acommunications network (i.e. the Internet, cellular networks) connectingthe Personal Audio Assistant to the Server or connecting the PersonalAudio Assistant to other Personal Audio Assistants (peer-to-peerbehavior).

In at least one exemplary embodiment a Personal Computer acts as anintermediate, connecting to the Server system over a communicationsnetwork and connecting to the Personal Audio Assistant over a localconnection. At least one exemplary embodiment includes a PersonalHearing Damage Intervention System (e.g., USPTO—60/805,985—Goldstein).

In at least one exemplary embodiment a Personal Audio Assistant systemincluded as part of a Headphone system, the system comprising: aCommunications Port supporting a communications protocol enablingcommunication with the Server system, peer devices, and other capabledevices; a non-volatile program memory storage system for storingControl Data, dictating system behavior; a data memory storage systemfor storing data and audio content; an analog audio input/output andcorresponding ADC/DAC; a digital audio input/output and a digital audiosignal path; a user control system allowing the user to adjust the levelof the audio output and control the behavior of the system; a usercontrol system allowing the user to purchase the content beingauditioned in real time; a user control system allowing the user tocontrol, delete, fast forward, output level control, scan, advance, thedata stored both stored in memory as well as new streaming data emailsand reminders; a display system for presenting information to theuser(s) visually using any method familiar to those skilled in the art(LED, LCD, or other); a display system for presenting information to theuser(s) (e.g., using Earcons and other sound files); a speech synthesissystem for converting text-to-speech and generating speech signals; aspeech recognition system for converting speech to-text to respond andsend emails and to interface with the control language as to providenavigational commands; a digital timer system; a power supply system inthe form of a battery; a unique identification number for each PersonalAudio Assistant; Input Acoustic Transducers; an Output AcousticTransducer; an Audio amplification system; Acoustic Isolation Cushionsconforming to one of the Ear Mold Styles (CIC, ITC, ITE, or BTE; seedefinitions) and other elements common to Headphone systems; a digitalsignal processor (DSP) system; and a CODEC processor capable ofimproving the perceptual sound quality of the content to be auditionedwhile governed by delivering the correct SPL dose.

In at least one exemplary embodiment the system is independent of aHeadphone array or can be included and imbedded as part of a PersonalComputer system, a Personal Music Player system, a personal monitoringsystem, an automotive audio system, a home audio system, an avionicsaudio system, a personal video system, a mobile cell phone system, apersonal digital assistant system, a standalone accessory, or anadvanced eye-wear system with acoustical transducers.

In at least one exemplary embodiment the various processing needed toderive the intended functions are distributed among any combination of aServer system, a Personal Computer system, a Personal Music Playersystem, a personal monitoring system, an automotive audio system, a homeaudio system, an avionics audio system, a personal video system, amobile cell phone system, a personal digital assistant system, astandalone accessory, or an advanced eye-wear system with acousticaltransducers.

In at least one exemplary embodiment the Personal Audio Assistant systemcan exchange audio signals with a mobile phone via the CommunicationsPort, allowing the Personal Audio Assistant to function as a mobilephone accessory.

In at least one exemplary embodiment a communications buffer isincluded. For example when a network connection is available, thecommunications buffer uploads stored content (e.g., Listening HabitsEnvelope) and stores incoming transmissions (e.g., music, electronicbooks, and updates to the firmware or operating system) from theCommunications Port; The contents of the communications buffer are thentransmitted whenever a network connection becomes available. At leastone exemplary embodiment includes a perceptual audio codec decodingtechnology in the DSP, enabling the storage and playback of compresseddigital audio formats (e.g., MP3, AAC, FLAG, etc.). At least oneexemplary embodiment is compliant and compatible with DRM, FairPlay® andother forms of digital content governance.

At least one exemplary embodiment includes a user control system forselecting and playing back audio content stored in memory that operatesusing any combination of the following methods: a button or tactileinterface which upon auditioning a song can be pressed to order content;a button, tactile and/or voice controlled interface which, when pressedonce, commanded to, activates playback of short audio clips or audiothumbnails of the audio content stored in memory; When the button ispressed again during audio thumbnail playback, the current audio contentselection is played in its entirety; The behavior of this interface issimilar to the “scan” button interface common in FM/AM radio devices; abutton, tactile and/or voice controlled interface that, when pressed orcommanded to, skips to the next piece of audio content, which isselected randomly from all available audio content that has a play countequal to or less than the play count of the piece of audio contentcurrently playing; The behavior of this interface is similar to the“shuffle” behavior found in some personal music players; an interfacefor browsing audio content storage devices familiar to those skilled inthe art; and a process to allow for increased data memory storagecapacity for storing audio content.

In at least one exemplary embodiment the contents of the data memory areencrypted and controlled by the Server system only, prohibiting theend-user from loading unauthorized audio content into the data memory.Further the contents of the data memory can be manipulated by theend-user, allowing the user to transfer audio content to the PersonalAudio Assistant system from any device capable of interfacing with thecommunications port; For example, audio content can be transferred tothe system from a Personal Music Player or a Personal Computer.According to at least one exemplary embodiment, audio content (or othermedia content) updates are retrieved from the Server system any time aconnection is detected by the communications port. Furthermore, anexemplary embodiment can include an acoustical and/or visual indicatorinforming the user when a transfer of data is activated.

In at least one exemplary embodiment radio wave transmissions are usedto implement some communications protocol and the communications portacts as a radio receiver. Additionally the Personal Audio Assistant caninclude: interfaces with some personal scheduling software through thecommunications port; a speech synthesis system which generatesspeech-signal reminders corresponding to information from the schedulingsoftware, where the digital timer system triggers the presentation ofthe speech-signal reminders at the appropriate time.

Additionally the Personal Audio Assistant can interface with an emailplatform through the communications port; The speech synthesis systemconverts the email in text to speech and provides email to the user inaural presentation format. The system further comprising: a process inthe Registration engine allowing the user to optimize theirpersonalization process of incoming emails by associating a specificEarcon with the importance of the incoming email. As such, normalpriority email contains an introduction sound announcing to the user thelevel of importance the sender associated with their email; a speechrecognition system for converting speech-to-text which interfaces withthe control language as to provide navigational commands allowing theuser to respond and send emails.

In at least one exemplary embodiment the communications port systemmakes use of some wireless communications protocol (802.11, Bluetooth®,A2DP, or other) to transmit and receive digital audio data for playback,the system further comprising: an audio codec to encode and decodedigital audio transmissions; a wireless communications system (802.11,Bluetooth®, A2DP, etc.) for transmitting and receiving data (digitalaudio transmissions, Control Data, etc.); a method for pairing two ormore Personal Audio Assistants through a wireless communicationsprotocol to provide a secure exchange of audio content, data such as theuser's Public Data; an audio warning signal or visual display systemoutput that notifies the user anytime a compatible transmission becomesavailable; and a user control system enabling the user to switch betweenavailable compatible transmissions.

In at least one exemplary embodiment the system enables listeners toshare digital audio transmissions, the system further comprising: amethod for scanning for available digital audio transmissions withinrange; a user control interface for specifying digital audiotransmission behavior; a method for employing the system as a relay toother compliant devices; re-broadcasting digital audio transmissions toincrease wireless range. In at least one exemplary embodiment multiplesystems are capable of sharing the contents of their program and datamemory using the wireless communications protocol.

In at least one exemplary embodiment of the system, the input AcousticTransducer is used to record audio content to the data memory storagesystem, the system further comprising: an implementation of someperceptual audio codec technology in the DSP, enabling the storage ofcompressed audio formats (e.g., MP3, AAC, FLAG, etc); and an increaseddata memory storage capacity for storing recorded audio content.

In at least one exemplary embodiment, the stereo input AcousticTransducers are ultimately connected to the audio signal path at theDSP, allowing the user to audition Environmental Audio (e.g., speech ormusic) and mitigating the need for the user to remove the Headphoneapparatus to audition Environmental Audio, the system furthercomprising: a stereo pair of input Acoustic Transducers placed close tothe user's ear canal input, conforming to one of the Ear Mold Styles(CIC, ITC, ITE, or BTE, see definitions); and by mounting the inputAcoustic Transducers in a CIC or ITC configuration, spatial-acousticcues are preserved, creating a spatially-accurate Environmental Audioinput signal—essentially a personal binaural recording; a method foracoustically compensating for the non-linear frequency responsecharacteristics of the Acoustical Isolation Cushions of a givenHeadphone system by applying corresponding inverse filters to theEnvironmental Audio input signal at the DSP; With this method, thesystem acts as a linear-frequency-response hearing protection apparatus(e.g., USPTO—60/805,985—Goldstein).

At least one exemplary embodiment includes a system for firstattenuating Audio Playback and then mixing the Environmental Audio inputsignals, at a louder listening level, with the audio signal path usingthe DSP, where the system is activated by any combination of thefollowing methods: a manual switch to activate/deactivate the system; aspeech-detection apparatus to activate the system when speech isdetected as the principal component of the Environmental Audio input;and a music-detection apparatus to activate the system when music isdetected as the principal component of the Environmental Audio input.

At least one exemplary embodiment can include active noise reduction,echo cancellation and signal conditioning that can be environmentallycustomized through the registration process to better meet the user'sspecific needs (i.e., occupation-related noise cancellation); A typicalapplication would be a special set of noise cancellation parameterstuned to the drilling equipment used by a dentist.

In at least one exemplary embodiment the input Acoustic Transducers areinstead mounted within circum-aural, intra-aural BTE, or intra-aural ITEmolds (see Ear Mold Style), the system further comprising: a spatialaudio enhancement system for supplementing the spatial-acoustic cuescaptured by the stereo pair of input Acoustical Transducers to provideimproved spatial perception of Environmental Audio using any combinationof the following methods: the application of Generic, Semi-Personalized,or Personalized HRTF data to the Environmental Audio input signal; theapplication of binaural enhancement algorithms, familiar to thoseskilled in the art, to the Environmental Audio input signals; theapplication of a pinna simulation algorithm to the Environmental Audioinput signal; and a synthetic pinna apparatus placed just before thestereo input Acoustic Transducers.

At least one exemplary embodiment includes a Server system for thecreation, Registration, management, and delivery of PersonalizedServices, the system comprising: a communications system for interfacingwith public communication networks to exchange data with Personal AudioAssistants, a Client's computer, mobile phones, PDAs or other capabledevices; a database and database management system for storing andretrieving information relating to user Registration, PersonalizedServices, audio content, Control Data, and other data; a Registrationinterface system for collecting, storing, and applying informationprovided by users; a method for creating Personalized Services based onuser Registration information; an end-user audio content Lock-Boxstorage system, providing every registered user access to theirpurchased media content; a business-to-business interface system foracquiring audio content with record labels, copyright holders, and otherbusinesses; an E-tailing system including an electronic transactionssystem enabling users to purchase content, items offered for sale or paysubscription fees electronically; an E-Payment system compensating thevarious copyholders upon purchase of content by user; a Playlist engine,which acquires the user's Registration information, Listening HistoryEnvelope and then creates audio playlists, which is optimized for theuser preferences and further refinements; and an Email server, whichdistributes communications to the user and others, regarding marketingdata, the status of the user weekly SPL dose, and other information.

At least one exemplary embodiment includes machine-learning techniquesemployed to better optimize the user's preferences relating to audiocontent and other media content, the system further comprising: a methodfor tracking the purchase history of each user, relating the purchasehistory to media content preferences, and using the purchase history tomake media content recommendations; a method for examining a user'sdigital media library, stored on a Personal Computer, Personal MusicPlayer, or Personal Audio Assistant, from the Server system, andrelating media content preferences and media content recommendations tothe user's digital media library; and a method for examining a user'sListening History Profile.

At least one exemplary embodiment includes a Registration system forcollecting a wide variety of information from users, includinginformation necessary for creating Personalized Services, the systemcomprising: a Server system; an interface system for querying the userto collect registration information including demographics (age,gender), Playback Hardware information, Headphone information,occupational information, home and work locations, medicationinformation, music-related preferences, video-related preferences, andother information; a method for customizing Control Data based onregistration information; and a method for creating PersonalizedServices based on registration information.

In at least one exemplary embodiment a fast HRTF acquisition process isincluded as part of the Registration process, the system furthercomprising a method for the fast acquisition of Semi-Personalized HRTFdata via a deduction process, the method comprising: a database systemcontaining indexed, clustered HRTF data sets; an auditory test signalwith distinctive spatial characteristics, where two or more distinctsound source locations exist; a system for the application of potentialHRTF matches to the auditory test signal; and a feedback system,allowing the user to select the best listening experience from a numberof candidate listening experiences, based on the spatial qualityperceived in the HRTF-processed auditory test signal.

In at least one exemplary embodiment Personalized HRTF data is measuredand used instead of Semi-Personalized HRTF data, by any method familiarto those skilled in the art.

In at least one exemplary embodiment the user is provided some PersonalAudio Assistant free-of-charge or at a discount, given the user agreesto a subscription service commitment to receive Personalized Servicesfor a certain amount of time.

In at least one exemplary embodiment, as part of the PersonalizedServices, the user is provided with temporary audio contentcorresponding to the preferences indicated during the registrationprocess; Further, the user is given the option to purchase the audiocontent permanently; Otherwise, the audio content is replaced with newaudio content from the Server, after a predetermined amount of time or apredetermined number of playback counts, the system comprising: aPersonal Audio Assistant with an enhanced user control system, enablinga registered user to purchase media content directly from the PersonalAudio Assistant with a button; and a Personal Audio Assistant with anenhanced user control system, enabling a registered user to store areference to media content that can be purchased by the user at a latertime.

In at least one exemplary embodiment, video or gaming content isincluded as well as audio content, the system further comprising: aPersonal Audio Assistant with an enhanced visual display system, capableof playing video and/or gaming content.

In at least one exemplary embodiment, as part of the PersonalizedServices, the user receives medication reminders in the form of speechsignals, audio signals, text, or graphics on the user's Personal AudioAssistant; Medication reminders are generated by the Server system basedon the user's registration information.

In at least one exemplary embodiment, as part of the PersonalizedServices, the user receives stock market information in the form ofspeech signals, audio signals, text, or graphics on the user's PersonalAudio Assistant; The stock market information is selected by the Serversystem based on the user's registration information, the system furthercomprising: the user having successfully registered their Personal AudioAssistant with a brokerage firm, or other stock trading engines, theuser can then purchase or sell a stock by use of a user button or aspeech command.

Further in at least one exemplary embodiment, the user is able torequest specific media content to be transferred temporarily orpermanently to the user's Personal Audio Assistant, the system furthercomprising: an interface system operating on the Server allowing usersto request specific media content by artist, title, genre, format,keyword search, or other methods familiar to those skilled in the art;and a media content search engine system.

In at least one exemplary embodiment a Relative Audiogram compensationfilter is applied to audio signal path by the digital signal processor,the system either (e.g., USPTO—60/805,985—Goldstein): (a) RetrievesRelative Audiogram compensation information from a remote Server after aregistration process (during transmission, the information can includeHIPAA compliant encoding); or (b) calculates a compensation filter fromRelative Audiogram information obtained by the system locally. Forexample U.S. Pat. No. 6,840,908—Edwards, and U.S. Pat. No.6,379,314—Horn, discuss methods for the acquisition of an individual'sRelative Audiogram.

In at least one exemplary embodiment a Satellite Radiotransmitter/receiver (transceiver) is incorporated within the Headphoneproper, allowing the user to at least: receive XM®, Sirius® and otherbroadcasts for playback over the system; select radio stations forplayback over the system via the control system, the control systemcomprising either a single-click tactile interface or thespeech-controlled circuitry; store selected portions of such broadcastsin memory for later recall and playback via the control systems; engagea novel commercial-skip feature for attenuating the playback level ofsuspected sales commercials broadcasts; and engage a speech-skip featurefor attenuating the playback of speech (e.g., news, announcements,etc.).

At least one exemplary embodiment includes a Walkie-Talkie mode, whichbroadcasts input to the system's built-in microphone, whereby the user'sspeech can be detected by the input acoustic transducer and remotelybroadcast where at least one of the following occurs: the Walkie-Talkiemode receives input via AM/FM broadcasts (as well as digitalcommunications protocols) from a nearby user; the Walkie-Talkie modeallows nearby users to engage in conversation with increased perceptualclarity in noisy environments (e.g., aircraft cockpits), using forexample a noise-cancellation system; selectively engage and disengagethe Walkie-Talkie mode using the control system; detect other users ofthe system within a given range; and alert the user of the system whenother detected systems contain certain Public Data and contain apredefined Public Message Key (e.g., “If the detected system belongs toa single male between the ages of 25 and 30 and whose favorite sport istennis, then broadcast the message, ‘I like tennis also; would you liketo have coffee?’” or “If the detected system belongs to a user whoattended Princeton University, then broadcast the message, ‘GoTigers!’”).

At least one exemplary embodiment can use other communications toaccomplish this service rather than AM/FM; as such the system canincorporate communications transmission protocols (TCP/IP, USB, IEEE1394, IEEE 802.11, Bluetooth®, A2DP, GSM, CDMA, or other protocols) anda communications network (i.e. the Internet, cellular networks)connecting the Personal Audio Assistant to other Personal AudioAssistants. At least one exemplary embodiment can selectively controlthe broadcast of public data and public message keys via the controlsystem.

At least one exemplary embodiment includes a Sonification algorithmwithin the Headphone, which enables auditory display of digitallyreceived data, including for example financial data, news, GPS data, thesystem further containing a variety of sonification “themes” selectedduring the registration process that map requested data (e.g., currentaverage trading price of AAPL stock, the Dow Jones Industrial Index, andthe Nasdaq Composite®) to corresponding audio content (e.g., thefrequency of a sine tone presented in the left ear, the frequency of asine tone presented in the right ear, and the global amplitude of bothsine tones, respectively).

At least one exemplary embodiment includes an auditory display, which issynthesized by the onboard Digital Signal Processor. In at least oneexemplary embodiment the auditory display is created through the digitalaudio signal processing effects applied to any other acoustic data thesystem is capable of reproducing (e.g., terrestrial radio, prepurchasedaudio content in the user's digital library, electronic books, etc.);For example, a sudden listening level increase in the playback level ofa song to which the user was listening can be triggered by a predefinedalert condition (e.g., Nasdaq Composite° has exceeded 2200 points).

At least one exemplary embodiment includes the ability to create themesusing a computer program and uploading a file to the Headphone system.

At least one exemplary embodiment includes a speech recognition systemfor converting speech to HTML (Voice Browser), whereby the user canaccess the Internet, provide navigational commands, perform searches andreceive results via the Headphones through a text (HTML)-speechsynthesize.

Additionally, the Personal Audio Assistant can be totally incorporatedwith a mobile cell phone, or any portable technology which incorporatesany of the following protocols, TCP/IP, USB, IEEE 1394, IEEE 802.11,Bluetooth®, A2DP, GSM, CDMA, or others known to those of ordinary skillin the arts via a communications network (e.g., the Internet, cellularnetworks), the system further comprising: an Acoustic Transducerconstructed as part of the mobile cell phone or a series of AcousticTransducers, which are constructed as part of mobile cell phone; acommutations path incorporated into the mobile cell phone providing forbi-directional communication with a Headphone array; the incorporationof the mobile cell phone's microphone(s) to act as the EnvironmentalAudio Acoustical Transducer(s); and the incorporation of the mobile cellphone's keyboard or touch sensitive screen to function as a manual inputor to complement speech commands and that can act in a way to respond toPersonalized Services offered to a user.

While the present embodiments have been described with reference toexemplary embodiments, it is to be understood that the claimedembodiments are not limited to the disclosed exemplary embodiments. Thescope of the following claims is to be accorded the broadestinterpretation so as to encompass all such modifications and equivalentstructures and functions.

What is claimed is:
 1. A personal audio assistant, comprising: a firstmicrophone for capturing audio; a communication module communicativelycoupled to the first microphone and for accessing information from anetwork; a memory storage unit communicatively coupled to the firstmicrophone; and a voice controlled interface coupled to the microphone,the voice controlled interface being configured to: access the networkin response to detection of at least one or more voice commands from thecaptured audio; and present results in response to accessing the networkto a speaker via a text-to-speech synthesizer.
 2. The personal audioassistant of claim 1, wherein the communication module provides mediacontent recommendations in response to accessing the network.
 3. Thepersonal audio assistant of claim 2, wherein the voice controlledinterface is configured to perform a commercial transaction in responseto the at least one or more voice commands and wherein the communicationmodule tracks a purchase history and relates the purchase history tomedia content preferences.
 4. The personal audio assistant of claim 2,wherein the communication module searches and shares the media contentrecommendations and common interests with other devices having amatching profile.
 5. The personal audio assistant of claim 2, whereinthe network is the Internet and the communication module searches theInternet from voice commands and receive results via the text to speechsynthesizer.
 6. The personal audio assistant of claim 1, wherein thecommunication module requests specific media content to be transferredtemporarily or permanently to the memory storage unit.
 7. The personalaudio assistant of claim 1, wherein the communication module stores alistening history to the memory storage unit.
 8. The personal audioassistant of claim 1, wherein the communication module sharesinformation with earpiece devices within range based on GPS location andidentity.
 9. The personal audio assistant of claim 1, further comprisinga speech recognition system for processing from the audio and convertingspeech to text using a voice browser for access to the Internet; whereinthe communication module accesses the Internet to provide searching ofparticular attributes or common user interests from the processing ofthe audio and voice browser.
 10. The personal audio assistant of claim1, further comprising a text-to-speech processor to interface withpersonal scheduling software to provide auditory schedule reminders to auser of the personal audio assistant.
 11. The personal audio assistantof claim 10, further comprising a digital timer that triggers apresentation of auditory schedule reminders.
 12. The personal audioassistant of claim 11, wherein the text-to-speech processor generatesspeech-signal reminders corresponding to information from the personalscheduling software.
 13. The personal audio assistant of claim 1,further comprising a touch sensitive screen communicatively coupled tothe first microphone to complement detected voice commands.
 14. A methodfor providing a personal audio assistant, comprising: capturing audiofrom a first microphone; accessing information from a network inresponse to the captured audio using a voice controlled interface;storing information related to accessing the network onto a memorystorage unit; and presenting results in response to accessing thenetwork.
 15. The method of claim 14, comprising presenting the resultsaudibly using a text-to-speech synthesizer for output to a speaker. 16.The method of claim 14, comprising presenting the results on apresentation device among a display, a touch sensitive screen, or aspeaker to complement results from detected voice commands by presentingat least one among text, images, webpages, or audible outputs at thepresentation device.
 17. The method of claim 14, further comprisingdetection of a voice command from the captured audio and performing asearch of the network in response to the detected voice command.
 18. Themethod of claim 14, further comprising detection of inputs to a touchsensitive screen to complement detected voice commands.
 19. A personalaudio assistant suitable for use in eyewear, comprising: a firstmicrophone and a second microphone for stereo acoustic input andcapturing audio; a communication module for accessing information from anetwork in response to the captured audio using a voice controlledinterface communicatively coupled to the first microphone and secondmicrophone; a memory storage unit for storing information related toaccessing the network; and a speaker to present results in response toaccessing the network via a text-to-speech synthesizer.
 20. The personalaudio assistant of claim 19, wherein the speaker is a portion of anearphone and the personal audio assistant further comprising a processorfor detecting of a voice command from the captured audio and performinga search of the Internet in response to the detected voice command.